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R Barzilay, L Lee - Proceedings of HLTNAAGL 2004, 2004 - acLldciipenn.edu 
... For concreteness, in what follows we will refer to "sentences" rather than "text spans" since that 
is what ... state j to model digressions or unseen topics, we take the novel step of forcing its 
model to be ... Note that the contents of the "etcetera" cluster are ignored at this stage ... 

Offline recognition of unconstrained handwritten texts using HMMs and statistical language models 
A Vinciareiii, S Bengio, H Bunke ^ IEEE Transactions on Pattern 2004 ^ coiiiputer.org 
... were made to replace the words $w_i$ with corresponding classes $C(w_i)$ obtained through 
word clustering. ... the words, the system does not give a perfect transcription of the text, but allows 
the ... the first state of the first letter of the next word, then the language model term must ... 
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M Woszczyna, A Waibei Inlernational Conference on Spoken Language 1994 Giieseer 
... 2. THE SPONTANEOUS SCHEDULING TASK All experiments have been performed on 
transcribed text taken from ... Unlike in cluster algorithms traditionally used for langugage modeling, 
each word can be in several 40 ... Using State-Dependent Monograms as Language Model (2 ... 
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Topic segmentation with an aspect hidden Markov model 

DM Blei, PJ Moreno • Proceeriings of Ihe 241ii annual inlornalional 2001 ■ porlaLacniorg 

... A unigram language model is computed for each of these clusters and an appro- priate smoothing 

technique is ... Note that this model requires a segmented corpus to train, but works in an 

unsupervised ... To segment a new document, the stream of text is divided into a sequence of ... 



Links between Markov models and multilayer perceptrons 
H Bourlard, GJ Weilekens - IEEE Trafisactions on Pattern Analysis ..J990 - computer.org 
... maximization of P(XI W„ X) implies that of its bilinear map (3). A language model provides the 
value ... are avoided when using discriminant models: namely, the lack of balance between the 
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transition probability values (9) which only depend on the topology of the model and the ... 
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J Makhoiil, F Kubala, T Leek, D Liu, L Nguyen, R „. ■ Proc- IEEE, 2000 ■■ vc.csjithy,edyiw 
... We are required to find all three sets of names but classify all others as general 
language (GL). Fig. 7 shows the hidden Marl^ov language model used by 
IdentiFinder to model the text for each type of named entity. ... 

CombininQ optimai clustering and Hidden I^Jarkov riiodels for extractive summarization 
P Fung, G Ngai, G8 Cheung - „. of the ACL 2003 wofksiiop on 2003 - portai.acm.org 
... Human Language Technology Center, Dept. ... We propose Hidden Marl^ov models with 
unsupervised training for extractive sum- marization. ... Text cohesion is modeled by the transition 
probabilities of an HMM, and term distribution is modeled by the emission probabilities. ... 
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H Ney, L Welling, S Oitnanns, K ,„ - IEEE ... 1998 - kefa!lonia.!9ieco;ii,luc.gi 

... Then the states are clustered together using a bottom-up-strategy until the desired num- ber of 

states is reached. ... [2] K. Beulen, E. Bransch, H. Ney, "State Tying for Con- text Dependent Phoneme 

Models ... [8] H. Ney, S. Martin, F. Wessel, "Statistical Language Model- ing Using ... 
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U Creutz, K lagiis - Proceedings of [he Intemalional and Inierdiscipiinary ,,,, 2005 - Citeseer 
... As the set of stems is very large in a language, stems are not likely to be very ... From this point on, 
the full Categories-MAP model is used as it has been formulated mathematically above. ... Since 
there are transition probabilities, changes affect the con- text in which a morph occurs. ... 
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